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Figure 2A continued 

1841 CCGftGGATTG AACAAA6TTG ATGTTOCCGT IlTTTATTGCA GGAGCCAGAG AAGAAAGTGG AA2UUU!CTAC ACCACAGGCG 
> , .< .purP > 

1921 GGCGCGTGCT CAATGTGG7G GGAACTGGC6 CTACGCTA6A AGAA6CCA6A AAAGTGOCTT ACGAAAATAT OCATAAAATC 

> .i ptirD > 

GAGATCTOS» OE-P » 

BglZZ 



2001 


AATTTTGATT 


ATGAATATTA 


TCGCGAAGAC 


ATCGGGAAGA 


TATAATCTCG 

» 


CTGATTTTTA 


ACCAAAACAT 


ATTTAAAAAC 


2081 


6CTTTTGTTA 


CTTTTATAAA 


CAAAGGCGTT 


TTTCTATTTT 


TGT6CCACTA TAACATGATT 


TAACCCATGA 


AAAAAATACT 


2161 


AAAAATACTC 


ATTTTTCTAC 


TGCTCATTCC 


TTGGGTTTAT 


GCCCT6ATTT 


TAATC7TTAT 


AAATCCACCT 


ATCACCATTA 


2241 


CACAGCT6AG 


CAATTTATCT 


TATGGTTTCT 


CCAGAACACA 


GCTCGCTTAT 


6ATGAAATTC 


CGGCTAGT6C 


TAAATGGGCT 


2321 


GTAATTGCAG 


CAGAAGACCA 


GAATTTTGCC 


ATTCATAATG 


GCTTTGATTT 


TAAA6AAATT 


AAAACCGCCT 


ACGAGAAAAA 


'2401 


CAAAGCGG6C 


AAGAAATTGC 


GTGGCGGGAG 


CACCCTTTCG 


CAACAAACTG 


CCAAAAATGT 


ATTTTTGTGG 


CAAGGGCGCA 


2481 


CTTGGAT7AG 


AAAAGGATTG 


GAAACCTACT 


GCACCTTTAS 


CATCGAAACG 


CTGTGGA6CA A6GAGCGTAT 


TTTGCAAGTT 


2561 


TACCTCAACA 


ATGCCGAAAT 


GGGCAAAGGC 


GTTTATGGCA 


TAGAGGCAGC 


GGCGCAATAT 


TATTTTAAGA 


AAAACGCCTC 


2641 


ACAGCTCACG 


CCTACCGAGA 


CGGCACGCAT 


CATTGCCTGC 


CTGCCCAATC 


CCAAAAAATA 


CAATKTAAAC 


CC6CCAAGTG 


2721 


CCTACATCTC 


AAAACGC6GA 


CAAIGGATTC 


T6CGCCAAGT 


GCGAAACTTG 


AAAOGCGAIA 


GGGCTCTGA6 


CGAGATT6TG 


2B01 


AACACGCCCT 


AACGCCTGCC 


TCAACTCTTT 


GCACACAGTT 


TACCAACTCT 


CTGCGAAGAG 


TTCACAAACT 


crrcGCACAC 


2881 


ACTTCCCCAA 


GTCTTTGCAA AGAGTT6GGA 


GATACTTAGG 


CACAAAAAAA AGGAACCTCA 


TGAATA6AG6 


TTCCCTCTTC 


2961 


CTTAAAAGGA 


ATAAATAATA ATGTTTTTTA 


AGCTTTAGGC 


TTGGCTACTT 


TTTCAAAGCC 


TGCTGCCTTC 


ATGCTATCTA 








HindZZX 










3041 


G6ATACGCTT 


GCCTGGGCGG 


TAGTTTACGC 


CTACCTTTTT 


GATTAAdCCC GAATGAAAAT 


CTTTCTCTGT 

« « • « « 


ATCTGCCGCT 
• > R8 ■ V « ■ « V ^ 


3121 


CCACTGCTTA 


AAGTGGCATA GAGC6AGCCA A6CTTATCTA 


AACGAAC6AT TTTGC0C6CT 


GCCAAGGCGT 


CTTGAATTAC 



<R8.«AAGCITAAG 



Hindlll Hindlll 

3201 AT7CTCTAGC GCAATGATAA CGCCAC6AAT ATCTGCCTCG CTGAGTGCCG AAAACTTCTC GATTTGCTTA ACGA6CTGGT 

3281 CTATATCCAT TTCTCCATCG CTTGCCACCA CGGCATAGZA TTTTT6TGGC TCCCCTG6CT IGCTTCGGTT TCTACGCTGA 

3361 ATTACATTGT ATTTTATGCT CATAATTACT CTATTTTTAA TAGCCTCCCG ATG6ATATAA AGTTACGCTA CAATTAGGGT 

3441 CTCCATAA6C AAATCTATAC CCCTCTCTTT CATATTCCCT TCTCATTCTT CTT6C1CCAT CTCTCAAGGC ATCCGCTCTA 

3521 TTACTGCTAT ACCCCTCCT6 AAGAAAT6TG TCTGCACTTG AAGAAGAATA TGAAGAGCTA TGAGAATCGT GCAACATAGT 

3601 CCAAGCTCCA TCTTGAGCTA TAACATTTGC ATGACATGTA ACACCTATAG TATAATAAAA TCTCCTAGGA GGTTGTGTTC 

3661 CACCACCACC TCCAGAGCTA CTACTTTTTT TACAITGTCC ATTTTG6TTA GCATGATTTT GTCC6CCATC ACTTACTAAC 

3761 TTCTTAGCTT CTGCTAAGGC TTTTTCTCTT GCTTTCTTTT CAGCATCXGC TTG6CTAATT CCACTCACTG CTGTAGCTGT 

3841 CGCTTCTTTT TTATAGTTTA CCGA6GTTCC ATAATAGCCA CTACTACAAT TGTTTCT7GT AAAGTTTTTA TTAAAA6ATT 

3921 GAGTTTGTGT TGAG6TGTAC CCTCCGAAAC CTTTTACTTC TACAGTAAAG GTA6AACTCC CCAZGCTTAC GGGGAAGGTG 

4001 GCGATAGTAT ACGATTGCCC TGCCG6CATT TGTTTTACTT GATACACTCC A!rCTCCTCCC ACTTCTATGC TTGOCGTTAA 
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Figure 2A continued 

4081 ATTACCACTA CCGCtAAAAG A6CCT7CTGC TATTTTTAGT 6TTAAATCAT TTATATCCCC 7CCTTGTCCT TTTGCAGAAG 
4161 CTTTTGTTAC ACTTACAGCA TCATAAGCCC CTTTTCCAT7 66TATAAGGT ATTTATAIGG CCAAAC 
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Figure 2B continued 

1681 CAACCAATTG AGAGAGAAAA TCGGTGTGHT GTTCGGTAGT CCAGAAACCA CAACGGGTGG TAATGCACTT AAATTCTftTG 



1761 


CATCGGTGCG TCTAGACATT 

> 


CGTCGTTCTA CTCAGATTAA AGAT6GGAAC 


GATGTCATCG 


GAAACTTGAC 


TCGCGTAAAA 


1841 


GTAGTGAAAA ACAAAGTAGC TCCGCCATTC 

> . - 


CGTAGTGCAG AATTCGACAT 


TATGTATGGC 


6AAGGAATCT 


CTAAAGCAG6 








BcoRl 








1921 


CGAGA7TTTA GACATTGCTA CC6ATTTAGA AATC5TGAAA AAAAGTG6CT CTTGGTATTC 


TTATGCAGAT ACTAAACTA6 


2001 


GACAAGGGCG AGAZGCCGTG 


C6TGCGGTAT 


TGAAAGATAA TCCAGAATTA 


GCCGAAGAAT 


TAGAAGA6AA AATTAAAGAA 




















BglJI 














2081 


GAATTA6A6A AAAAATA6AI 


TTTTTAGTTT 


TTTTAATTAA 


ACGAAAAATC 


CGTTCACTTT 


GTTGAACGGA 


TTTTTTTATG 


2161 


CTTGAATGAA TTTATTTCCA 


ATGGATTGAA 


TAGCCAIGCA 


CTTTTAAATC 


TTCGCTATCA 


TAAGTGATTX 


CTTTGTCGGT 


2241 




la X I. lifXu^ VTA 


TTT6GCAAT6 


6CATGTCCT6 


CGGCAATGTC 


CCAAAAGTTT 


ACAGGTCTAA 


2321 


AGCGGGTGTA CTCC6TAGCC 


CACCGATCGG 


CAATTAGCCC 


AAlGTTTGATA 


ACGCTTCCCA 


TAGGCTtlGT 


GC3G6AAAATT 


2401 


TCATGTTCGG ATTTAATTTT 


TTTGATGTAT 


TCCTCGGTGC 


CAGGATCCAT 


GTG6AAT7TG 


CTACAAAGAA 


AAGTGTAATC 


2481 


TTC6GGCAAA TCCATGGTAG 


GAATT6GCTT 


GCTGTGTTTC 


ATCAAT7GTT 


CAAAAAAATC 


CGATTTCAGA 


GCCATTTTGT 


2561 


GCAATIGITG TTGAGTCCCG 


ATGAATTTAC 


GA6AAGGGCA 


TTTA7CGCTA 


CCGAAATAGA 


ACAATCCAAG 


CGATGGGGCG 


2641 


TACAAAACTC CTAGCTTAGC 


CGTATTATTC 


TCAACIAAGC 


CTAGACACAC 


6CAAXATTCA 


TCTGTTTTGT 


TGACAAAATC 


2721 


CATGGTGCCA TCAATAGG6T 


CT6CAATCCA 


ATAG6TGGGC 


GTA7TTCTAA 


TTTCTTGTAA 


AGAATCCTTA 


XCTCCTTCCT 


ZoOl 


CACTAAAGTA TGGAATGTCT 


GTAAAGGAAA 


CATGTTTTTG 


CAAGATTTTG 


TTGGCGGCTA 


AATCTGCACT 


7GTAACA6GC 


2881 


GATCC6TCGG CTTT6GTCTC 


6GTGGAGAAT 


CC6TTTTGGA 


TTGTTTTAAA 


ACCTCTTCGC 


CAGCAAGTGC 


TACAGCCCGT 


2961 


GITGCGATTT CTAATAAAIT 


CATAATCATT 


CTTTTATTCT 


C6AACAAAGT 


CAAATAATTC 


TCTGTATTAA 


AAAATAATTT 


3041 


TGGCGATAAA AATTAAAATT 


TATATATAAA 


ATATCTCTGC 


AAAAAACCAA 


ATCAAATATT 


7AGTGAAATA 


AAAAAAATTA 


3121 


GATTGTAAAT TT6CCTTATG 


TTTTTAGA6A 


ATACCATAAA 


TCATAGAAAA 


AATACGGGCT 


G6ATC6AAGT 


AATCTGTGGC 


3201 


TCTATGTTTT CGG6CAAAAC 


CGAAGAGTTG 


ATTCGTAGAG TGAAAC6AGC CGAATTGGCT 

« as 


GGGCAAAAGG TAGAAATCTT 














JUndXJl 




3281 


TAAACCCGCA ATTGATAAAC 


GCTACGATGA 


GCAAGATGTG 


GTA7CGCA7G ATGAAAACAA AAAACAAGCA ACCCCGATTG 


3361 


AG6CGA6TXC TAACTtGCCC 


ATTTTA6CAA 


GCGAtTGTGA 


TGTGGTGGGG 


ATAGATGA6G 


CXCAATTCTT 


TGACGAAGGA 


3441 


ATTGTTGAGG T6GCAAATCT 


TTTAGCTAAT 


TCGG6GAAAA 


GAATAATTAT 


TGCGGGAITFA 

« 


GACATGGATT 


TTAAAGGTCG 

« 


3521 


TCCATTTGGT GCTATGOCAA 


ATTTAAT6GC 


GGTAGCGGAA 


TATGTGACCA AAGTGCATGC 


AATCTGTGT6 AAAACAG6GA 
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